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ABSTRACT 

Most recent progress in understanding the dynamical evolution of star clusters 
relies on direct iV-body simulations. Owing to the computational demands, and the 
desire to model more complex and more massive star clusters, hardware calculational 
accelerators, such as GRAPE special-purpose hardware or, more recently, GPUs (i.e. 
graphics cards), are generally utilised. In addition, simulations can be accelerated by 
adjusting parameters determining the calculation accuracy (i.e. changing the internal 
simulation time step used for each star). 

We extend our previous thorough comparison (jAnders et al.ll2009i ) of basic quan- 
tities as derived from simulations performed either with STARLAB/kira or nbodyG. 
Here we focus on differences arising from using different hardware accelerations (in- 
cluding the increasingly popular graphic card accelerations/GPUs) and different cal- 
culation accuracy settings. 

We use the large number of star cluster models (for a fixed stellar mass func- 
tion, without stellar/binary evolution, primordial binaries, external tidal fields etc) 
already used in the previous paper, evolve them with STARLAB/kira (and NBODY6, 
where required), analyse them in a consistent way and compare the averaged results 
quantitatively. For this quantitative comparison, we apply the bootstrap algorithm for 
functional dependencies developed in our previous study. 

In general we find very high comparability of the simulation results, independent 
of the used computer hardware (including the hardware accelerators) and the used 
.ZV-body code. For the tested accuracy settings we find that for reduced accuracy (i.e. 
time step at least a factor 2.5 larger than the standard setting) most simulation results 
deviate significantly from the results using standard settings. The remaining deviations 
are comprehensible and explicable. 

Key words: Methods: N-body simulations, Methods: statistical, open clusters and 
associations: general 



1 INTRODUCTION 

In recent years, the modelling of star populations strongly 
advanced due to stellar dynamical studies. The fields 
covered comprehend a wide range of research questions, 
such as the modelling of individual star clusters (e.g. 
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The majo r codes used in this field are the family of 
nbodyx codes l|Aarsethl 19991 ) and the starlab environment 
with its A-body integrator kira (|Portegies Zwart et al.l 
l200ll ). Both codes are continuously expanded and im- 
proved, including the ability for the use of hard- 
ware dedicated to improve the simulation capabilities 
(in terms of simulation duration and available popula- 
tion size). This includes advances in software develop- 
ment (such as para l lelisation, see e. g. lHarfst et al.1 120071: 



ISpurzem et alj 120081 ; IPortegies Zwart et al 



20081) and the 
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computers/computer clusters 
area networks, such as GRID 

Additional A-body codes are emer ging, such as MYR- 
IAD (|Konstantinidis fc Kokk otas 2010, GRAPE-enabled), 
NBSymple ( Capuzzo-Dolcetta et ail 201ll GPU-enabled) 
and AMUSE 1 l (jPortegies Zwart et all 20091 . a compilation 
of various modules for gravitational dynamics, stellar evolu- 
tion, radiative transfer and hydrodynamics, which can be- 
come independently combined). GPUs are also applied in 
an additio nal w i de range of astrophysica l simulation envi- 
ronmcnts (|Fordl 120091 ; ISchive et all l20ld : iThompson et al.l 
|2010| ). Their general availability and useability for compu- 
tationally intensive astrophysical modelling will continue the 
rise of GPU usage. 

Despite their extensive use and computational special- 
ities (see Sects. [2] and I4.2f) . a detailed study concerning 
the reliability and comparability of simulations performed is 
missing. Important factors, which are studied in the present 
work, include the use of various computer hardware (includ- 
ing diverse special-purpose hardware to accelerate the sim- 
ulations) and simulation accuracy settings. The named fac- 
tors determine the used calculation accuracy and therefore 
the accuracy to trace accurately fast important phases of 
star cluster evolution. Potentially, this can lead to incom- 
mensurable simulation results, although starting configura- 
tion and physical treatment are equivalent. 

In Sect. [2] describes the used hardware and software, 
including their limitations in calculation accuracy. Sect. [3] 
summarizes the used simulation and analysis methods. In 
Sect. [4] the simulation results are explained and analysed, 
with a summary of all results and drawn conclusions in Sect. 

m 



2 UTILISED HARDWARE AND SOFTWARE 

We used 3 PCs for these simulations, whose specifications 
are summarised in Table [T] The use of these 3 PCs was nec- 
essary to allow testing the sensitivity of the results on the 
PC architecture and the availability of varying accelerator 
hardware. The labels "PCI", "PC2" and "PC3" will be used 
to indicate the use of solely the CPU, without the avail- 
able accelerator hardware. "PCI/GPU", "PC2/GRAPE" 
and "PC3/GPU" indicate the usage of the available accel- 
erator hardware. 



For the majority of tests we use the starlab software 
package. Where we supplement these with simulations made 
with nbody we use NBODY6, contrary to our previous study 
(|Anders et al.l |2009| . hereafter "Paper I" ) where we used 
NBODY4. NBODY6 presents a significant enhancement of re- 
quired computational ti me due to the AC neighbour scheme 
llAhmad fc Cohenlll973l for the AC neighbour scheme and 
lAarsethll200lT for the nbody implementation). 



2.1 Possible reasons for simulation differences 

Numerical modeling of A-body systems, such as star clus- 
ters, usually involves long-term simulations. It is there- 
fore necessary to understand the influence of different er- 
rors on the outcome. In a very basic case of a direct in- 
tegration, there are two contributions to the overall er- 
ror: one due to the time integration and one due to the 
accuracy of force calculations. In the idealistic case these 
two should be balanced, but i n the majority o f codes 
(e.g. NBODY4, starlab, phiGrapelflarfst et al.l2008l . Myriad 
iKonstantinidis fc Kokkotasi l2010l ) a simple direct summa- 
tion method is employed to compute forces, whereas NBODY6 
uses the Ahmad-Cohen neighbour scheme. While this, in 
principle, should result in exact accelerations, in practice, 
due to limited precision, several factors affect the result. For 
example, the order in which partial forces are added affects 
the accuracy: if the partial forces are added from the weak- 
est to strongest the total force on a particle is more accurate 
compared to the one if the summation is done in reverse or 
in arbitrary order. This is exacerbated if the precision of 
the force accumulation is below a certain threshold which 
depends on the system being modelled. On CPU, a generic 
double precision force loop usually adds up partial forces 
in IEEE double precision arithmetics, which, is safe to say, 
results in the most accurate force. On special purpose ac- 
celerations (such as GRAPE or GPU), for performance con- 
siderations, the summations is done in lower precision (e.g. 
IMakino et all2003l ; lNitadorill200Sl;lGaburov et alj|2009h . For 
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example. iNitadoril (|2008l ) and lGaburov et al.l (|2009l ) use two 

single precision numbers to emulate double precision, and 
the second authors conduct force accumulation in single pre- 
cision. The emulation of double precision is inherently less 
precise (2x23 bits mantissa) compared to IEEE double preci- 
sion (53 bits mantissa), and its long-term errors are poorly 
understood. This may potentially inflict damage to long- 
term integrations. 

In direct A-body simulations the error is usually due 
to time integration (among other errors coming from binary 
and multiple treatment, stellar evolution, close encounter 
treatment) rather than force summation. Generally in A"- 
body codes, the ti me-step is defined by the Aarseth crite- 
rion (|Aarsetbll2003l ) which is usually scaled by an "accuracy 
parameter" 77. The time-step size monotonically depends on 
r\ and changes in this parameter have proportional changes 
in the value of the time-step. In a generally employed 4" 1 
order Hermite scheme, the error decreases as the i th or- 
der of the time-step, and for a realistic time-step criterium 
which provides a good balance between duration of simula- 
tions and accuracy, this error remains dominant. Further- 
more, time integration error has similar properties among 
different codes due to its simpler nature. 

If rj is increased above a certain value, the time inte- 
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Table 1. Specifications of PCs used 



specification 


PCI 


PC2 


PC3 


Processor 


Quad Core Intel Q9300 


Dual Core Intel Xeon 


Dual Core AMD Opteron 


Operating system 


Debian 4.0 


Red Hat 2.6.9-67 


Debian 2.6.24.7 


C compiler 


gec 4.2 


gec 3.4.6 


gec 4.3.2 


Special hardware 


GeForce 9800 GX2 


GRAPE6-BLX64 


GeForce 9800 GX2 


Special hardware libraries 


Sapporo vl.5 


g6bx-0515 


GPU module by K. Nitadori 


A-body software 


STARLAB 4.4.4 


STARLAB 4.4.4 


NBODY6 



gration errors become intolerable, for example close encoun- 
ters and dynamical binary integration become inaccurately 
simulated, but consistent among different codes. If rj is de- 
creases beyond a certain value, the dominant source of er- 
ror becomes force computation, which varies among differ- 
ent implementations to a large degree. While this produces 
more accurate results, the outcome from different codes may 
show large scatter in sensitive quantities, such as energy con- 
servation error, due to possibly significantly different force 
summation (or other) errors, rather than similar time inte- 
gration errors, influence long-term simulations. In this paper 
we explore a range of 77 parameters to identify their influence 
on the simulation results. 



3 METHODS 

In this section we will summarize the methods developed 
in Paper I and applied in the present paper. For additional 
details see Paper I. 



3.1 Standard setup 

We used start models from the predecessor study. They were 
created using STARLAB (and, where necessary, transformed 
into the nbody file format) and have the following charac- 
teristics 

• 1024 (lk ) parti cles 

• ISalpeterl ij 19551 ) stellar mass function, with the highest 
mass = 10 x lowest mass 

• no primordial b inaries 

• IPlummerl |l91lf) sphere initial density profile 

There are 50 start models, with identical parameters 
but varying statistical realisations^ 

All simulations were performed with the following set- 
tings: 

• no stellar/binary star evolution 

• no external tidal field 

• duration of simulation: 1000 Af-body time units (i.e., 
until well after core collapse, which occurs around 60 N- 
body time units, see Paper I). 



2 The start models are available at 

http://data.galev.org/nbody/NBODY_STARLAB_Comparison/ 
as "MF10 models" and at http://www.initialconditions.org/14. 



Unless otherwise specified, the simulations were per- 
formed using 

• hardware acceleration: GeForce 9800GX2 GPU 

• calculation accuracy parameter ?7=77starlab=0.1 for 
STARLAB, equivalent to ?7nbody=0.01 (due to a different 
definition of the accuracy parameter in STARLAB and in 
NBODY6, see Sect. 14.2]) . All quoted accuracy parameters are 
given as ?7=?7starlab. 

3.2 Analysis of A-body simulation snapshots 

As we want to trace differential effects between our settings 
(i.e., nbody6 vs. starlab, solely CPU vs. CPU+GRAPE 
vs. CPU+GPU, different accuracy parameters) we need to 
ensure that output from different simulations is treated iden- 
tically. The main difficulty arises when comparing NBODY6 
and STARLAB output, as the output content and format is 
very different. E.g., NBODY6 output does not contain de- 
tailed information about binaries, while STARLAB output 
contains information on binary /triple/higher order hierar- 
chies. 

We therefore reduce the format of each simulation out- 
put to the most basic one possible, only containing for each 
star its mass and vectors for its position and velocity. Bi- 
naries and higher order hierarchies are split up into their 
components. Other possibly present information is removed. 

We then split the series of output into individual snap- 
shots, pass them through kira, the A-body integrator of 
the STARLAB package, which evolves them for the l/32 th 
part of an Af-body time unit. This procedure rebuilds the 
binary /higher order hierarchy structure, and stores it acces- 
sibly in the resulting snapshot. All snapshots with rebuilt 
hierarchy structure are then analysed using tookjf] available 
within the STARLAB package. In Paper I we investigated the 
reliability of this procedure and did not find significant bi- 
ases (except for the spurious re-building of multiple systems 
in the outer cluster regions, which are not subject of the 
present study). In addition, we calculate the change of to- 
tal energy per N-body time unit as a measure of the energy 
conservation. 

Results of this procedure are time series of cluster pa- 
rameters (like core radius, virial ratio, etc) and binary pa- 
rameters (here only investigated at the end of the simulation; 
as shown in Paper I testing at different times gives compa- 
rable results but poorer statistics). For the further analysis, 

3 Specifically the HSYS_STATS program of STARLAB. 
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results from the 50 individual runs per simulation setting are 
combined: the lists of binary parameters are merged, while 
for the overall cluster parameters a median time series and 
the 16/84 per cent quantiles are calculated. 

3.3 Analyses of the simulation outcome 

In this section we want to summarize the methods used for 
the analysis of the iV-body data. Many of the analysis results 
are given as probabilities. We will adopt the following (com- 
monly used) nomenclature when discussing the significance 
level of statistical test results 

• highly significant: p- value < 1% 

• significant: p-value < 5% 

• weakly significant: p-value < 10%. 

"Multiple testing" : if we perform 100 independent tests, 
a fraction of the order of 10% of p- values below 0.1 will arise 
by chance even if none of the test null hypotheses is wrong. 
As we analyse various star cluster parameters of a large num- 
ber of simulations, we need to take "multiple testing" into 
account . 



3.3.1 Binary and escaper parameters 

We will analyse the parameters of dynamically created bi- 
naries (semi-major axis, eccentricity, mass ratio, binding en- 
ergy) and the energy distributions of stars escaping from the 
cluster. 

Differences between two such distributions for two dif- 
ferent simulat ion setups are evaluated using a Kuiper test 
l|Kuiperlll962f l. an advanced KS test (fo r the KS test an d 
Kuiper test see, e.g., Numerical Recipes iPress et al.lll992f ). 
A Kuiper test returns the probability that two distributions 
are drawn from the same parent distribution (or, more accu- 
rately, the probability that one is wrongly rejecting the null 
hypothesis "The two distributions are drawn from the same 
parent distribution.", which in most cases is equivalent). 



3.3.2 Time series 

In Paper I we developed a bootstrap algorithm for func- 
tional dependencies and applied it to time series similar to 
the ones studied in the present work. For example, we want 
to quantitatively compare the time evolution of the core ra- 
dius (and of other parameters) for two different simulations 
setups. 

First, we introduce a measure of the difference between 
two time series' (or, more general, between two functional 
dependencies). Assume we have two functional dependen- 
cies of one parameter from the independent variable x: yi(x) 
and 2/2 (a:) - For each x these dependencies have uncertainties 
ai(x) and <r 2 (:r). The relative difference between the func- 
tional dependencies at a given x is then: 

yi (x) - y 2 (x) 



Si2{x) = 



(1) 



^ai(x) 2 + a 2 (x) 2 

We then define the "difference between functions 1 and 
2" as 



where N is the number of datapoints used for the statis- 
tic. We consider only the absolute value, as we want to have 
a measure of the size of the difference, but not necessarily 
its direction. In addition, this ensures A12 = A21. 

Equivalently we define the "absolute difference between 
functions 1 and 2" as 



1 

N 



(3) 



1 

TV 



EM*) 



C2) 



While A12 is more sensitive to systematic offsets, Ti2 
traces also statistical fluctuations. 

We then utilise this measure of difference between two 
time series' to design a bootstrap-like algorithm to quantify 
the significance of this difference. 

We calculate 300 test clusters with STARLAB using the 
standard settings (i.e., using PCI/GPU and with r/=0. 1), 
and the same analysis routines as for the other clusters. 

From these test clusters we randomly select sets of 50 
clusters each (i.e. the number of clusters in the main simula- 
tions) with replacement, and calculate for each parameter 
the median y T (x) and quantiles <r T (x). 

We build 2000 such sets. Out of those we randomly 
select two sets (again with replacement) and derive the in- 
dividual values of A^ 2 and Tj 2 . We repeat this procedure 
10000 times to estimate the Af 2 and rf 2 test distributions 
for each parameter. As all test clusters are calculated with 
the same settings, the Af 2 and Fi% test distributions repre- 
sent the null hypothesis "functions 1 and 2 are drawn from 
the same parent distribution" . By comparing these test dis- 
tributions with the values derived from the main simulations 
A 12 and Ff 2 we can quantify the fraction of data in the test 
distribution with Af 2 or r 12 more deviating than the values 
derived from the main simulations Af 2 or Ff 2 . This value 
serves as measure of how similar the two main simulations 



In order to avoid applying the test statistic to highly 
correlated data, which appears for the earliest timesteps (as 
all runs share the same type of start models, only different 
stochastic realisations) and which is beyond the area of ap- 
plication of the test statistic, we start the summation in Eq. 
f2]and[3]at 20 TV-body time units. This is sufficiently well be- 
fore core collapse, which appears at about 60 JV-body time 
units (see Paper I), to capture the systems' behaviour dur- 
ing this important epoch. As we have shown in Paper I, the 
influence of varying this start time slightly is small and does 
not change the conclusions drawn. 



4 RESULTS 

In this section we will compare simulations made for a num- 
ber of specifications. 

The quantitative test results are tabulated in Appendix 
[B]and summarized in Sect. [5] Visual presentations of time 
series for selected results are provided in Appendix [A] . 



4.1 Different hardware, using STARLAB 

We have performed simulations on PCI and PC2. For each 
PC we realised one set of simulations with the installed ac- 
celerator hardware and one set without hardware accelera- 
tion. 
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The results are visualised in Figs. IA1I and IA2I The 
agreement appears to be very good, with possible excep- 
tions being the evolution of the kinetic energy (though with 
large error bars, only the virial ratio is shown for consistency 
with the other sections) and the cumulative eccentricity and 
mass ratio distributions of the dynamically created binaries. 
The quantitative measures (our bootstrap algorithm for in- 
tegrated cluster properties, and the Kuiper test for the prop- 
erties of binaries and escaping stars) , shown in Tables IB1I - 
IB4I generally confirm the good agreement. The following 
issues are found 

• significant differences: kinetic energy of stars escaping 
from the cluster for 

#1.1 PCI ^ PCI/GPU 

and differences (visible in Fig. IA2[) in the mass ratio distri- 
butions for 

#1.1 PCI & PCI/GPU 

#1.2 PCI/GPU <^ PC2/GRAPE 

• weakly significant: the A12 test of King Wo and the 
potential energy for 

#1.3 PC2 <^ PC2/GRAPE 

• statistically insignificant: differences in the total kinetic 
energy of all stars and the binary eccentricity distributions 
(visible in Fig. \M\i 

Deviations in the mass ratio distributions can originate 
from the stochastical process of two-body interaction pro- 
cesses. The number and level of the remaining deviations 
is in agreement with "multiple testing", i.e. the occurence 
of statistical disagreement based on large numbers of tests. 
Therefore, iV-body simulations based on equivalent start 
conditions and A-body code (here STARLAb) produce highly 
comparable results, independent of the computer hardware 
(including the accelerator hardware). 



4.2 Accuracy settings 

Using PCI/GPU, we test the impact of the calculation ac- 
curacy. 

The "calculation accuracy" is determined by a dimen- 
sionless constant 77 in the formul a determining the integra- 
tion time step (see lAarsethll2003l . Sect. 2.3) 

A , f (|F||FW| + |F|2)~ 

1 ~ SAVnbody |p(l)||p(3)| _|_ |p(2)|2 

/ (|F||FW| + |F|~ 

IjstarlabU | p ( 1) | | p (3) | + | p (2) | 2 W 

where F is the force affecting particle i and F' J ' its j-th 
time derivative. 

Due to the slightly different definitions used by star- 
lab and nbody is 7/ s t a ri a b=y / 77iibody. In the remainder of the 
paper we will express all accuracy settings as ?7=?7stariab. 

The results are displayed in Figs. I A3l and IA4I 

In most panels, the simulations with the largest ac- 
curacy parameter (i.e., the "worst" accuracy) studied are 
clearly offset from the other simulations. The other simula- 
tions seem to provide quite comparable results, except for 
the change in total energy during 1 A-body time unit (i.e., 



a measure for energy conservation) . The change in total en- 
ergy reduces with lowering the accuracy parameter 77 both 
before and after core collapse, i.e. total energy becomes bet- 
ter preserved when improving accuracy. 

These findings are confirmed by the quantitative mea- 
sures from our tests: 

• (highly) significant: most results for the 77=0.3 and 
77=0. 4 runs strongly deviate from the results using the stan- 
dard settings, independent of used hardware and studied 
parameter. For the 77=0.25 runs, the results depend on the 
studied parameter: one part of the results follow the stan- 
dard setting results, while for the other parameters the re- 
sults deviate from the standard setting runs similarly to the 
77=0.3 run results (no clear correlation between studied pa- 
rameter and found deviation) 

• (highly) significant: the change in total energy during 1 
JV-body time unit (i.e. our measure for energy conservation) 
is deviating from the standard settings for almost all runs 

• (highly) significant: additional individual deviations are 
found, without a specific correlation with simulation set- 
tings, used hardware or studied parameter. 

The strongly deviating results for 77=0.3 and ?7=0.4 runs 
(and partially the 77=0.25 runs) represent strong discourage- 
ment for such "inaccurate" simulations. 

The results for the energy conservation is expected. The 
better the simulation accuracy the better even slight effects 
are traced. These slight effects, e.g. individual close encoun- 
ters or binary evolution, have strong impact on the conser- 
vation of the system's energy. 

The number and level of additional deviations exceeds 
the expectations based on "multiple testing", also without 
the 77=0.3 and 77=0.4 runs. The level of accuracy changes 
the simulation results slightly but cumulatively, therefore 
fundamental agreement for all studied parameters from sim- 
ulations using different accuracy settings is not assured. 

4.3 STARLAB VS. NBODY6 

We have performed simulations on PCI and PC3. For each 
PC we realised one set of simulations with the installed 
GPUs and one set without hardware acceleration. The STAR- 
LAB simulations were performed with ?7 atar i a b = 0.1, while 
the NBODY6 simulations were performed with 77 star i a b ~ 0.15 
{y\i = VO. 02 and rj r = V0.03). As shown in Sect. 14.21 these 
settings give comparable results. 

The results are shown in Figs. IA5I and IA6I They indi- 
cate potential deviations in kinetic energy for the NBODY6 
runs using PC3, in the change of total energy (during 1 JV- 
body time unit) between NBODY6 and STARLAB runs, and in 
binary eccentricity for the GPU runs using either NBODY6 
or STARLAB. 

Our quantitative tests confirm the deviations for the 
change in total energy during 1 A-body time unit: It is big- 
ger (i.e. energy conservation is worse) for NBODY6 as com- 
pared to STARLAB. This agrees with our earlier finding for 
NBODY4 vs. STARLAB in Paper I. 

The remaining expectations, based on visual examina- 
tion of Figs. IA5I and lAOl are not confirmed. Our statistical 
tests indicate deviations in 

• significant: the time evolution of r max (i.e., the distance 
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of the furthest star from the cluster center) when comparing 
simulations made with STARLAB and nbodyG, both without 
hardware acceleration 

• weakly significant: the binary mass ratio distribution 
for #3.2 PCf O PC3 

• weakly significant: the distribution the kinetic energy of 
stars escaping the cluster for #3.1 PCI/GPU <^ PC3/GPU 

These results are partially inconsistent with our pre- 
vious study of STARLAB vs NBODY4 in Paper I. In Paper 
I no significant deviations in x max were found, the binary 
mass ratio distributions were different (the probability of 
both being drawn from the same parent distribution was 
determined to be 14.9%, slightly larger than required for a 
statistically slightly significant deviation) and for the kinetic 
energy of stars escaping the cluster no significant deviations 
was found. The detected differences in the present work are 
expected to originate from code differences between NBODY4 
and NBODY6 (code enhancements, such as the AC neighbour 
scheme) or STARLAB (version 4.4.2) and STARLAB (version 
4.4.4) (which included bug fixes). 

The results for the time evolution of r mal and the bi- 
nary mass ratio distributions are based on processes with 
strong statistical effects as these quantities are affected by 
rare events, e.g. direct two-body interaction processes. In 
general, the number and level of deviations agrees with the 
expectations from "multiple testing". These results allow 
reliable comparisons of NBODY6 and STARLAB simulations, 
based on comparable input parameters and independent of 
the used computer hardware. 



5 SUMMARY OF RESULTS & CONCLUSIONS 

All of the quantitative test results are shown in Tables IB1I 

- IB4I They are based on 50 input models and their consis- 
tently analysed output. 

From these tables the following picture emerges 

• in general, the results obtained from iV-body simula- 
tions in various configurations are well comparable (with 
some exceptions which will be discussed as follows), inde- 
pendent of the computer hardware, the accelerator type and 
iV-body code used. 

• results with too coarse accuracy (for our tests, 
Vstarlab = 0.3 and tjstarlab = 0.4, partially t)starlab 

— 0.25) give strongly significantly deviating results in a va- 
riety of quantities, hence should be avoided. 

• the quantity which shows the highest sensitivity to the 
configuration used is the energy conservation (here described 
by the change of total cluster energy during one iV-body 
time unit). While it seems to be unaffected by the hard- 
ware used, it changes strongly with the accuracy parameter 
77, and between NBODY6 and STARLAB. The behaviour with 
accuracy parameter is expected and desired. The differences 
between nbodyG and STARLAB are comparable to our earlier 
finding when comparing NBODY4 and STARLAB in Paper I. 

• the significant deviation found for some configurations 
in the temporal evolution of r max (distance of furthest star 
from cluster center) and the mass ratio distribution of bi- 
naries could be stochastic effects, as these quantities are af- 
fected by rare events (e.g. close two-body encounters and 



binary evolution). In our test setting, these quantities devi- 
ate statistically significant, though larger-scale simulations 
are required to fundamentally check the statistical signifi- 
cance. 

• the remaining significant deviations can be real or a re- 
sult of "multiple testing". The number of performed tests 
and of observed deviations is consistent with resulting from 
"multiple testing" for the study of hardware (#1.1-1.4 PCI 
-v=> PC2, utilising or relinquishing the available accelerator 
hardware) and of the used iV-body code (#3.1-3.3 STARLAB 
<4> NBODY6). For the study of accuracy settings the num- 
ber and strength of deviations is higher than expected from 
"multiple testing" for the energy-related cluster properties 
and the binary properties. 

To conclude: In general, direct N-body simulations are 
assumed to be correct in a statistical sense rather then 
on the level of individual trajectories. Except for few well- 
understood cases (such as the results for various accuracy 
settings and the energy conservation), iV-body simulations 
using different hardware or TV-body codes give very compa- 
rable results, confirming the reliability in a statistical sense. 
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Figure Al. Comparison of simulations using STARLAB and various hardware. The lines show the median values, the error bars give 
the uncertainty ranges from the 50 individual runs. Shown are the time evolutions of the core radius (top left, bottom lines), half-mass 
radius (top left, upper lines), the objects' mean mass in the core = measure for mass segregation (top right), virial ratio (bottom left) 
and change in total energy during 1 iV-body time unit (bottom right). 




Mass ratio m2/m1 Binding energy -E/kT 

Figure A2. Comparison of binary parameters from simulations using STARLAB and various hardware after 1000 ./V-body time units (i.e., 
well after core collapse). Shown are the cumulative distributions of the semi- major axis (top left), the eccentricity (top right), mass ratio 
of secondary to primary (bottom left) and the binding energy (bottom right). The lines show the data, the error bars give the uncertainty 
ranges from bootstrapping (at every 20 th data point only, for clarity). 
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Figure A3. Comparison of simulations using STARLAB and various accuracy settings, all using PCI and GPU. The lines show the median 
values, the error bars give the uncertainty ranges from the 50 individual runs. Shown are the time evolutions of the core radius (top left, 
bottom lines), half-mass radius (top left, upper lines), the objects' mean mass in the core = measure for mass segregation (top right), 
virial ratio (bottom left; 77=0.4 is not shown due to very large deviations and error bars) and change in total energy during 1 TV-body 
time unit (bottom right). 
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Figure A4. Comparison of binary parameters from simulations using STARLAB and various accuracy settings after 1000 TV-body time 
units (i.e., well after core collapse). All using PCI and GPU. Shown are the cumulative distributions of the semi-major axis (top left), 
the eccentricity (top right), mass ratio of secondary to primary (bottom left) and the binding energy (bottom right). The lines show the 
data, the error bars give the uncertainty ranges from bootstrapping (at every 20 th data point only, for clarity). 
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Figure A5. Comparison of simulations using STARLAB or NBODY6, with either GPU acceleration or no hardware acceleration. The lines 
show the median values, the error bars give the uncertainty ranges from the 50 individual runs. Shown are the time evolutions of the 
core radius (top left, bottom lines), half-mass radius (top left, upper lines), the objects' mean mass in the core = measure for mass 
segregation (top right), virial ratio (bottom left) and change in total energy during 1 N-hoAy time unit (bottom right). 
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Figure A6. Comparison of binary parameters from simulations using STARLAB or NBODY6, with either GPU acceleration or no hardware 
acceleration, after 1000 TV-body time units (i.e., well after core collapse). Shown are the cumulative distributions of the semi- major axis 
(top left), the eccentricity (top right), mass ratio of secondary to primary (bottom left) and the binding energy (bottom right). The lines 
show the data, the error bars give the uncertainty ranges from bootstrapping (at every 20 th data point only, for clarity). 
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Table Bl. Bootstrap results for structural cluster properties. Given are the fractions (in %) of the test cluster distributions (which 
represent the case that they are drawn from the same parent distribution) more deviating than the main simulations, i.e. the smaller 
this number the less alike the distributions are. The "standard" setup uses PCI/GPU and an accuracy setting of r)=0.1. Quantities are: 
r C oro = cluster core radius, r^aif = cluster half-mass radius, r max = distance of furthest star from cluster center, <mass> CO re = mean 
mass of objects in the core (i.e. a measure for mass segregation), King Wo = dimcnsionlcss potential Wq of a King profile (fitted by the 
HSYS-STATS analysis task of STARLAB), | density c. | = absolute distance of cluster density center from origin. For each parameter the 
probabilities from the A12 and Ti2 distributions (see Sect. I3.3,2[ l are given. If no r\ value is shown, r/=0.1 is used. For a more detailed 
description of the setups see text. Notes: e denotes data sets with "extreme" uncertainties, i.e. "extreme run-to-run" scatter (applies 
only to the r]=0A data sets). 



#ID setupl setup2 r CO rc r half r max King Wo | density c. | <mass> corc 









A12 Tl2 


A12 


Tl2 


A12 Tl2 


A12 


Tl2 


A 12 


Tl2 


A12 


Tl2 


xests 


using different PCs and accelerator hardware 




















1.1 


PCI/GPU 


PCI 


69.84 90.39 


40.34 


69.53 


32.25 40.35 


29.99 


62.47 


65.88 


81.42 


80.62 


31.04 


1.2 


PCI/GPU 


PC2/GRAPE 


53.04 21.16 


91.99 


75.21 


72.32 88.50 


32.04 


22. 54 


38.78 


47.13 


35.55 


39.60 


1.3 


PC2/GRAPE 


PC2 


31.78 42.99 


35.33 


69.37 


18.52 21.18 


5.76 


33.24 


67.62 


74.21 


48.73 


72.74 


1.4 


PCI 


PC2 


99.76 99.94 


99.89 


99.94 


99.94 99.94 


99.92 


99.94 


99.91 


99.94 


99.89 


99.94 


Tests 


using different accuracy settings 77 






















2.1 


standard 


PCI/GPU 77 =0.05 


90.53 78.08 


33.26 47.85 


45.83 54.34 


80.62 


56.33 


16.17 


16.73 


63.13 


56.13 


2.2 


standard 


PCI/GPU 77 =0.2 


24.23 62.85 


84.23 


59.07 


59.08 69.36 


22.56 


37.74 


39.55 


46.59 


39.16 


30.37 


2.3 


standard 


PCI/GPU 77 =0.3 


16.76 11.23 


51.91 


22.42 


6.67 8.25 


3.68 


3.74 


48.51 


61.55 


5.93 


9.22 


■> ^ 


standard 


PCI/GPU 77 =0.4 


0.10 2.24 


0.76 


0.79 


n nn n nn e 
1 1 . 1 j 1 1 1 j . 1 1 1 j 






0.01 0.01 " 


0.14 


1.84 


2.5 


PCI/GPU 77 =0.05 


PCI 77 =0.05 


OO. U 7 Ol./O 


12.97 


17.48 


84.45 73.63 


90.32 


33.38 


22.55 


16.88 


71.13 


54.56 


2.0 


PCI/GPU 77 =0.3 


PCI 77 =0.3 


40.35 62.27 


78.80 


90.92 


2.92 3.19 


73.41 


58.60 


23.97 27.10 


84.90 


73.96 


2.7 


PCI/GPU 77 =0.4 


PCI 17 =0.4 


5.76 17.15 


30.84 


54.66 


4.56 4.55 " 


83.53 


27.20 


12.03 14.27 " 


2.26 


5.77 


2.8 


PCI 77 =0.1 


PCI 17 =0.025 


38.52 89.72 


62.60 


85.86 


6.54 4.90 


17.27 


63.72 


85.57 


72.98 


33.01 


58.43 


2.9 


PCI 77 =0.1 


PCI 17 =0.05 


83.51 72.57 


16.69 


29.78 


65.07 53.41 


35.25 


52.36 


81.28 


74.73 


53.60 


76.91 


2.10 


PCI 7; =0.1 


PCI 77 =0.3 


7.73 65.07 


63.79 


88.17 


0.18 0.25 


10.86 


34.68 


1.58 


1.84 


2.82 


4.32 


2.11 


PCI 17 =0.1 


PCI 77 =0.4 


6.12 41.94 


0.78 


0.79 


0.45 0.58 " 


2.54 


7.09 


7.90 10.02 " 


1.75 


6.19 


2.12 


PC2/GRAPE 77 =0.1 


PC2/GRAPE 77 =0.05 


12.53 4.09 


79.69 


85.86 


41.02 41.70 


5.48 


5.79 


84.51 


91.26 


61.87 


15.14 


2.13 


PC2/GRAPE 77 =0.1 


PC2/GRAPE 77 =0.2 


60.32 70.07 


64.11 


91.51 


45.93 62.23 


31.19 


55.00 


82.72 


92.35 


40.92 


91.06 


2.14 


PC2/GRAPE 77 =0.1 


PC2/GRAPE 77 =0.25 


36.41 22.06 


52.11 


37.57 


2.40 2.86 


12.73 


25.07 


77.08 


86.30 


34.87 22.32 


2.15 


PC2/GRAPE 77 =0.1 


PC2/GRAPE 77 =0.3 


98.73 27.86 


82.25 


91.92 


0.05 0.09 


84.11 


54.87 


5.04 


6.38 


29.73 


58.32 


2.16 


PC2 77 =0.1 


PC2 77 =0.025 


35.15 88.64 


64.30 


84.28 


3.94 2.72 


13.60 


59.63 


77.85 


51.25 


28.70 


49.31 


2.17 


PC2 77 =0.1 


PC2 77 =0.05 


83.47 72.21 


16.73 


29.86 


65.07 53.41 


35.19 


52.63 


81.35 


74.71 


53.64 


77.51 


2.18 


PC2 77 =0.1 


PC2 77 =0.2 


36.16 76.20 


43.41 


50.17 


65.45 87.24 


41.31 


61.96 


88.75 


94.54 


89.62 


55.47 


2.20 


PC2 77 =0.1 


PC2 77 =0.25 


41.90 81.57 


11.63 


17.46 


20.53 17.35 


10.52 


34.10 


72.97 27.49 


83.06 


46.80 


2.21 


PC2 77 =0.1 


PC2 77 =0.3 


7.65 64.17 


63.92 


88.13 


0.18 0.25 


10.53 


33.50 


1.58 


1.84 


2.82 


4.43 


2.22 


PC2 77 =0.1 


PC2 77 =0.4 


2.34 13.15 


0.79 


0.80 


0.00 0.00 


1.84 


3.30 


0.00 


0.00 


1.45 


5.21 


2.23 


PCI 77 =0.025 


PC2 77 =0.025 


99.85 99.94 


99.89 99.94 


99.94 99.94 


99.77 99.94 


99.91 


99.94 


99.81 


99.94 


2.24 


PCI r, =0.05 


PC2 77 =0.05 


99.82 99.94 


99.94 99.94 


99.94 99.94 


99.87 99.94 


99.88 


99.94 


99.94 99.94 


2,25 


PCI 7; =0.3 


PC2 77 =0.3 


99.69 99.94 


99.89 99.94 


99.94 99.94 


98.39 99.94 


99.94 99.94 


99.87 99.94 


2.26 


PCI 77 =0.4 


PC2 77 =0.4 


55.01 99.94 


62.69 


99.91 


13.21 16.31 


65.91 


99.94 


30.13 


37.76 


83.58 


99.94 



Tests using STARLAB and NBODY6 

3.1 standard PC3/GPU 87.41 78.24 62.78 87.11 88.98 82.84 65.80 44.58 37.98 46.33 42.62 56.20 

3.2 PCI PC3 94.11 74.08 81.33 90.53 1.05 1.29 41.68 29.64 69.88 87.79 53.41 18.48 

3.3 PC3/GPU PC3 88.41 90.83 89.97 39.29 12.73 14.04 82.22 75.36 97.00 98.75 94.30 46.76 
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Table B2. Bootstrap results for energy-related cluster properties. Given are the fractions (in %) of the test cluster distributions (which 
represent the case that they are drawn from the same parent distribution) more deviating than the main simulations, i.e. the smaller 
this number the less alike the distributions are. The "standard" setup uses PCI/GPU and an accuracy setting of 77=0. 1 . Quantities are: 
Epot = cluster potential energy. Eki n = cluster kinetic energy, Q v i r = cluster virial ratio, Etot = cluster total energy, <5Etot = change 
in cluster total energy during 1 TV-body time unit. For each parameter the probabilities from the A12 and T12 distributions (see Sect. 
I3.3.2H arc given. Note: e denotes data sets with "extreme" uncertainties, i.e. "extreme run-to-run" scatter (applies only to the 77=0. 4 
data sets). s denotes data sets with strong "spikes" / "wiggles" in the median parameter evolution, compared to other data sets in the 
same category. 



#ID 


sctupl 


setup2 


Epot 


Ekin 




Etot 


5E tot 








A12 Pl2 


A12 Tl2 


A12 Tl2 


A12 Pl2 


A12 Ti2 


Tests 


using different PCs and accelerator hardware 










1.1 


PCI/GPU 


PCI 


36.33 74.09 


78.07 94.07 


98.78 85.97 


73.78 79.77 


10.97 67.01 


1.2 


PCI/GPU 


PC2/GRAPE 


57.97 99.21 


38.76 43.44 


41.61 39.07 


67.69 53.08 


12.28 39.33 


1.3 


PC2/GRAPE 


PC2 


9.13 24.42 


25.44 28.05 


42.35 43.88 


31.91 34.91 


41.07 95.35 


1.4 


PCI 


PC2 


78.35 99.94 


99.91 99.94 


99.94 99.94 


83.79 99.94 


36.01 99.86 


Tests using different accuracy settings 77 












2.1 


standard 


PCI/GPU 77=0.05 


7 7n n sn 

1 • 1 \J U.DU 


35.54 39.81 


48.15 49.59 


0.79 0.71 


0.80 0.80 


2.2 


standard 


PCI/GPU 77=0.2 


77.95 83.90 


69.85 79.04 


88.87 80.26 


81.35 86.65 


1.07 1.87 


2.3 


standard 


PCI/GPU 77=0.3 


19.04 7.42 


0.36 0.40 


0.79 0.85 


1.16 1.26 


0.80 0.80 


2.1 


standard 


PCI/GPU 77=0.4 


0.83 0.90 


58.73 78.17 e 


61.43 95.12 e 


86.74 99.94 e 


0.68 0.80 


2.5 


PCI/GPU 77=0.05 


PCI ?7=0.05 


15.92 0.80 


88.13 61.33 


87.25 94.55 


0.44 0.36 


0.80 0.80 


2.6 


PCI/GPU t?=0.3 


PCI 77=0.3 


33.84 24.99 


50.99 26.35 


27.13 16.48 


60.81 58.50 


13.87 1.73 


2.V 


PCI/GPU 77=0.4 


PCI 77=0.4 


0.99 1.06 


84.21 99.94 e 


85.30 99.94 e 


95.44 99.94 e 


16.38 2.15 


2.8 


PCI 77=0.1 


PCI 77=0.025 


54.62 73.73 


32.20 37.03 


45.17 50.23 


35.63 42.21 


6.65 78.87 


2.9 


PCI 77=0.1 


PCI 77=0.05 


19.24 45.50 


41.95 43.63 


37.25 39.30 


40.89 36.31 


9.72 96.04 


2.10 


PCI 77=0.1 


PCI ?7=0.3 


56.43 33.79 


0.26 0.32 


0.63 0.67 


1.28 1.39 


0.80 0.80 


2.11 


PCI 77=0.1 


PCI 77=0.4 


7.82 1.39 


86.25 99.94 e 


87.80 99.94 e 


95.09 99.94 e 


0.78 0.80 


2.12 


PC2/GRAPE ?7=0.1 


PC2/GRAPE 77=0.05 


54.20 40.15 


6.29 6.41 


13.68 11.89 


19.97 12.78 


4.32 21.91 


2.13 


PC2/GRAPE 77=0.1 


PC2/GRAPE 77=0.2 


41.02 77.53 


13.73 14.83 


10.71 10.76 


18.60 18.18 


0.80 0.88 


2.14 


PC2/GRAPE 77=0.1 


PC2/GRAPE ?7=0.25 


29.41 25.22 


0.36 0.38 


0.30 0.30 


1.35 1.47 


0.00 0.00 


2.15 


PC2/GRAPE 77=0.1 


PC2/GRAPE 7;=0.3 


99.94 99.94 


0.06 0.06 


0.58 0.62 


0.01 0.01 


0.00 0.00 


2.16 


PC2 77=0.1 


PC2 77=0.025 


42.86 74.61 


29.10 34.59 


40.68 46.22 


24.94 30.01 


0.78 99.94 


2.17 


PC2 77=0.1 


PC2 77=0.05 


12.26 22.60 


41.97 43.63 


37.21 39.24 


47.95 40.23 


4.93 67.99 


2.18 


PC2 77=0.1 


PC2 77=0.2 


55.86 61.63 


38.84 42.36 


49.19 56.02 


48.23 48.72 


0.80 0.81 


2.20 


PC2 77=0.1 


PC2 77=0.25 


10.10 25.20 


12.12 12.31 


24.05 24.53 


41.47 41.30 


0.00 0.00 


2.21 


PC2 77=0.1 


PC2 77=0.3 


40.47 33.49 


0.26 0.32 


0.63 0.67 


1.50 1.61 


0.78 0.80 


2.22 


PC2 77=0.1 


PC2 77=0.4 


3.81 0.96 


0.00 0.00 


0.00 0.00 


0.49 0.75 


0.00 0.00 


2.23 


PCI 77=0.025 


PC2 77=0.025 


99.84 99.94 


99.91 99.94 


99.93 99.94 


99.88 99.94 


0.73 99.94 


2.24 


PCI 77=0.05 


PC2 77=0.05 


99.72 99.94 


99.90 99.94 


99.79 99.94 


99.78 99.94 


5.98 89.68 


2,25 


PCI 77=0.3 


PC2 77=0.3 


99.75 99.94 


99.91 99.94 


99.92 99.94 


99.94 99.94 


74.42 99.94 


2.26 


PCI 77=0.4 


PC2 77=0.4 


90.92 99.87 


90.53 99.94 


91.68 99.94 


96.84 99.94 


0.88 1.10 


Tests 


using STARLAB and NBODY6 












3.1 


standard 


PC3/GPU 


57.81 93.14 


67.28 75.89 


89.47 93.85 


65.10 70.36 


0.80 0.82 


3.2 


PCI 


PC3 


33.04 62.02 


15.09 16.65 


25.11 29.54 


16.68 18.97 


0.80 0.81 


3.3 


PC3/GPU 


PC3 


53.66 54.56 


11.52 12.50 


30.14 35.33 


19.37 21.52 


7.22 2.87 
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Table B3. Kuiper test results for binary properties. Given is the probability (in %) that for the given setups the binary properties 
are drawn from the same distribution. The "standard" setup uses PCI/GPU and an accuracy setting of 77=0.1. Quantities are: axis 
= semi-major axis, ecc = eccentricity, E/kT = binding energy in E/kT, m2/ml = mass ratio secondary mass / primary mass. #X = 
number of binaries in simulation X. 



#ID 


setupl 


setup2 


#1 


#2 


axis 


ecc 


E/kT 


m2/ml 


Tests 


using different PCs and accelerator hardware 












1.1 


PCI/GPU 


PCI 


201 


202 


83.57 


83.99 


60.39 


4.99 


1.2 


PCI/GPU 


PC2/GRAPE 


201 


206 


80.18 


32.28 


69.14 


1.63 


1.3 


PC2/GRAPE 


PC2 


206 


202 


99.97 


13.67 


49.19 


76.06 


1.4 


PCI 


PC2 


202 


202 


100.00 


100.00 


100.00 


100.00 


Tests using different accuracy settings 77 














2.1 


standard 


PCI/GPU ?7=0.05 


201 


iyu 


Qfi Oft 


QO 1 Q 

oo.lo 


A 9 m 
4Z.U1 


y 1 .oy 


2.2 


standard 


PCI/GPU 77=0.2 


201 


213 


97.26 


72.84 


90.72 


44.90 


2.3 


standard 


PCI/GPU 77=0.3 


201 


213 


9.20 


51.32 


0.81 


28.47 


2.4 


standard 


PCI/GPU 77=0.4 


201 


253 


0.00 


65.53 


0.00 


51.58 


2.5 


PCI/GPU 77=0.05 


PCI 77=0.05 


190 


202 


29.28 


77.83 


53.79 


33.28 


2.6 


PCI/GPU 77=0.3 


PCI 77=0.3 


213 


oon 


£Q 7Q 
OO. ( O 


yo.oD 


OO.Di3 


on c.a 
oz.u4 


2.7 


PCI/GPU 77=0.4 


PCI 77=0.4 


253 


Qsn 
zoU 


fi7 P.A 
( .04 


oO. lO 


nil 


( y. 14 


2.8 


PCI 77=0.1 


PCI 77=0.025 


909 
ZUZ 


204 


94.15 


6.24 


82.66 


28.19 


2.9 


PCI 77=0.1 


PCI 77=0.05 


202 


o n o 


on A Q 


i .UO 


KQ Q 1 

oo.ol 


O Q Q 


2.10 


PCI 77=0.1 


PCI 77=0.3 


202 


oon 


on o/i 


1 Q nt; 


1 in 
±.±u 


Q Q Q 
O.OO 


2.11 


PCI 77=0.1 


PCI 77=0.4 


202 




n nn 
U.UU 


00. oy 


n nn 
U.UU 


01 a 


2.12 


PC2/GRAPE 77=0.1 


PC2/GRAPE 77=0.05 


206 


191 


35.85 


92.07 


47.82 


27.43 


2.13 


PC2/GRAPE r;=0.1 


PC2/GRAPE 77=0.2 


206 


204 


43.81 


64.46 


42.24 


17.05 


2.14 


PC2/GRAPE 77=0.1 


PC2/GRAPE 77=0.25 


206 


198 


28.62 


75.40 


26.71 


21.53 


2.15 


PC2/GRAPE 77=0.1 


PC2/GRAPE 77=0.3 


206 


234 


48.59 


4.71 


5.57 


4.23 


2.16 


PC2 77=0.1 


PC2 77=0.025 


202 


204 


93.99 


8.03 


82.66 


28.19 


2.17 


PC2 77=0.1 


PC2 77=0.05 


202 


201 


20.43 


37.05 


58.81 


2.33 


2.18 


PC2 77=0.1 


PC2 77=0.2 


202 


204 


64.77 


3.43 


41.88 


33.95 


2.20 


PC2 77=0.1 


PC2 77=0.25 


202 


202 


66.44 


30.82 


73.82 


0.27 


2.21 


PC2 77=0.1 


PC2 7)=0.3 


202 


220 


20.24 


18.05 


1.10 


3.38 


2.22 


PC2 77=0.1 


PC2 77=0.4 


202 


248 


0.03 


54.16 


0.00 


19.66 


2.23 


PCI 77=0.025 


PC2 77=0.025 


204 


204 


100.00 


100.00 


100.00 


100.00 


2.24 


PCI 77=0.05 


PC2 77=0.05 


202 


202 


100.00 


100.00 


100.00 


100.00 


2,25 


PCI 77=0.3 


PC2 77=0.3 


220 


220 


100.00 


100.00 


100.00 


100.00 


2.26 


PCI 77=0.4 


PC2 77=0.4 


260 


248 


93.34 


99.88 


0.22 


99.97 


Tests 


using STARLAB and NBODY6 














3.1 


standard 


PC3/GPU 


201 


189 


48.81 


12.63 


54.46 


76.14 


3.2 


PCI 


PC3 


202 


209 


46.91 


34.27 


74.56 


7.80 


3.3 


PC3/GPU 


PC3 


189 


209 


22.69 


43.63 


80.84 


49.64 
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Table B4. Kuiper test results for energy-related properties of unbound stars at 1000 A/-body time units (i.e. well after core collapse). 
Results with a "B" superscript denote results for unbound binary systems. Given is the probability (in %) that for the given setups the 
properties are drawn from the same distribution. The "standard" setup uses PCI/GPU acceleration and an accuracy setting of tj=0.1. 
Quantities are: Eki n = reduced kinetic energy, E po t = reduced potential energy, Etot = reduced total energy. #X = number of systems 
in simulation X. 



#ID 


setupl 


setup2 


#1 


#2 


Ekin 


Epot 


Etot 


an B 
#1 


#2 a 


E, ■ 

km 


T? B 

Epot 


E tot 


Tests 


using different PCs and accelerator hardware 




















11 


PCI/GPU 


PCI 


7554 


7441 


4.71 


31.42 


13.47 


144 


150 


96.68 


94.73 


96.46 


1.2 


PCI/GPU 


PC2/GRAPE 


7554 


7518 


11.47 


11.08 


34.37 


144 


142 


86.53 


87.59 


86.53 


1.3 


PC2/GRAPE 


PC2 


7518 


7441 


49.20 


73.55 


59.18 


142 


150 


84.42 


62.13 


91.79 


1.4 


PCI 


PC2 


7441 


7441 


100.00 


100.00 


100.00 


150 


150 


100.00 


100.00 


100.00 


Tests 


using different accuracy settings r\ 






















2.1 


standard 


PCI/GPU 7) =0.05 


7554 


7592 


30.86 


16.44 


62.53 


144 


144 


86.44 


92.10 


86.44 


2.2 


standard 


PCI/GPU r, =0.2 


7554 


7640 


21.75 


16.36 


34.70 


144 


154 


86.56 


36.04 


74.98 


2.3 


standard 


PCI/GPU 77 =0.3 


7554 


8199 


0.00 


0.00 


0.00 


144 


100 


88.99 


67.80 


93.17 


2.4 


standard 


PCI/GPU 77 =0.4 


7554 


10052 


0.00 


0.00 


0.00 


144 


20!) 


15.77 


35.88 


30.41 


2 ■■:> 


PCI/GPU ?) =0.05 


PCI 77 =0.05 


7592 


7557 




88.07 


87.85 




13(1 


1 3.8(1 




17.95 


2.(1 


PCI/GPU ?) =0.3 


PCI 77 =0.3 


8199 


7364 


0.00 


0.00 


0.00 


100 


1(1!) 


64.47 


71.74 


47.64 


2.7 


PCI/GPU rj =0.4 


PCI 77 =0.4 


10052 


9567 


0.00 


0.11 


0.00 


201) 


1!>7 


82.27 


32.84 


74.65 


2.8 


PCI 77 =0.1 


PCI 77 =0.025 


7441 


705!) 


36.43 


55.25 


16.48 


150 


148 


(10.58 


32. 59 


69.61 


2 .11 


PCI 77 =0.1 


PCI 77 =0.05 


7441 




16.38 






lo() 


13(1 


05 .41 


91.57 


65.41 


2.10 


PCI 77 =0.1 


PCI 77 =0.3 


7441 


7364 


3.71 


7.26 


12.84 


150 


Hi!) 


54.09 


84.83 


72.21 


2.11 


PCI 77 =0.1 


PCI 7j =0.4 


7441 


9567 


0.00 


0.00 


0.00 


150 


197 


15.87 


6.38 


15.87 


2.12 


PC2/GRAPE rj =0.1 


PC2/GRAPE 7j =0.05 


7518 


7405 


98.19 


50.21 


97.57 


142 


1.34 


99.92 


67.48 


99.96 


2.13 


PC2/GRAPE rj =0.1 


PC2/GRAPE 7) =0.2 


7518 


7493 


71.42 


80.02 


74.28 


142 


151 


49.01 


49.97 


49.01 


2.14 


PC2/GRAPE ?) =0.1 


PC2/GRAPE 7j =0.25 


7518 


7366 


41.45 


88.32 


48.86 


142 


14!) 


56.28 


46.16 


62.32 


2.15 


PC2/GRAPE ?) =0.1 


PC2/GRAPE r] =0.3 


7518 


7254 


8.61 


22.26 


5.48 


142 


161 


30.29 


23.97 


30.29 


2.16 


PC2 77 =0.1 


PC2 77 =0.025 


7441 


7659 


36.43 


55.25 


17.08 


150 


148 


60.58 


32.59 


69.61 


2.17 


PC2 77 =0.1 


PC2 >7 =0.05 


7441 


7557 


16.38 


64.68 


22.13 


150 


136 


65.41 


91.57 


65.41 


2.18 


PC2 77 =0.1 


PC2 77 =0.2 


7441 


7414 


3.52 


38.18 


5.98 


150 


145 


80.71 


53.26 


73.67 


2.20 


PC2 77 =0.1 


PC2 77 =0.25 


7441 


7497 


6.38 


84.60 


24.08 


150 


158 


99.50 


92.74 


99.91 


2.21 


PC2 77 =0.1 


PC2 >7 =0.3 


7441 


7364 


3.71 


7.26 


12.84 


150 


Kill 


54.09 


84.83 


72.21 


2.22 


PC2 77 =0.1 


PC2 77 =0.4 


7441 


7398 


0.00 


0.00 


0.00 


150 


190 


2.74 


6.40 


2.74 


2.23 


PCI 77 =0.025 


PC2 77 =0.025 


7659 


7659 


100.00 


100.00 


100.00 


14S 


148 


100.00 


100.00 


100.00 


2.24 


PCI 77 =0.05 


PC2 77 =0.05 


7557 


7557 


100.00 


100.00 


100.00 


1.3(1 


136 


100.00 


100.00 


100.00 


2.25 


PCI 17 =0.3 


PC2 77 =0.3 


7364 


7364 


100.00 


100.00 


100.00 


Kill 


169 


100.00 


100.00 


100.00 


2.26 


PCI 77 =0.4 


PC2 77 =0.4 


9567 


7398 


0.00 


0.00 


0.00 


107 


190 


99.91 


91.22 


99.99 


Tests 


using STARLAB and NBODY6 






















.3.1 


standard 


PC3/GPU 


7554 


7501 


9.59 


48.69 


21.44 


144 


1.35 


99.84 


99.22 


99.94 


3.2 


PCI 


PC3 


7441 


7507 


81.64 


17.46 


63.79 


150 


140 


92.75 


71.24 


87.68 


3.3 


PC3/GPU 


PC3 


7501 


7507 


72.60 


37.09 


82.08 


1.35 


14(1 


79.35 


29.11 


85.84 
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